Trellis encoded vector quantization for robust speech recognition

نویسندگان

  • Wu Chou
  • Nambi Seshadri
  • Mazin G. Rahim
چکیده

In this paper, a joint data (features) and channel (bias) estimation framework for robust speech recognition is described. A trellis encoded vector quantizer is used as a pre-processor to estimate the channel bias using blind maximum likelihood sequence estimation. Sequential constraint in the feature vector sequence is explored and used in two ways, namely, a) the selection of the quantized signal constellation, b) the decoding process in joint data and channel estimation. A two state trellis encoded vector quantizer is designed for signal bias removal applications. Comparing with the conventional memoryless VQ based approach in signal bias removal, the preliminaryexperimental results indicate that incorporatingsequential constraint in joint data and channel estimation for robust speech recognition is advantageous.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Low Bit Rate Speech Coding via TCVRQ

We present a new Trellis Coded Vector Residual Quantizer (TCVRQ) that combines trellis coding and vector residual quantization. We introduce new methods for computing quantization levels and experimentally analyze the performances of our TCVRQ in the case of speech coding at very low bit rates. The results obtained show that transparent quantization of Linear Prediction (LP) parameters can be p...

متن کامل

Quantization of LSF parameters using a trellis modeling

An efficient Block-based Trellis Quantization (BTQ) scheme is proposed for the quantization of the Line Spectral Frequencies (LSF) in speech coding applications. The scheme is based on the modeling of the LSF intraframe dependencies with a trellis structure. The ordering property and the fact that LSF parameters are bounded within a range is explicitly incorporated in the trellis model. BTQ sea...

متن کامل

Block Constrained Trellis Coded Vector Quantization of LSF Parameters for Wideband Speech Codecs

ETRI Journal, Volume 30, Number 5, October 2008 ABSTRACT⎯In this paper, block constrained trellis coded vector quantization (BC-TCVQ) is presented for quantizing the line spectrum frequency parameters of the wideband speech codec. Both a predictive structure and a safety-net concept are combined into BC-TCVQ to develop the predictive BC-TCVQ. The performance of this quantization is compared wit...

متن کامل

Low - Delay Wideband Speech Coding Using a New Frequency Domain Approach

In this paper a new frequency domain approach suitable for low-delay wideband speech coding is proposed. Working in the context of residual speech coders, the proposed technique performs a decomposihon of the D I T of the target vector, (the input speech after the subtraction of the zero input response signal), as the product of the DFT of the impulse response of the LPC synthesis filter and a ...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996